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<N ; Abstract 
> 

Tij- ' An extension of quantum mechanics to a generalized Nambu dynamics leads 

• to a new version of nonlinear quantum mechanics. The time evolution of 

\o 

• states is given in here by a triple bracket generalization of the Liouville-von 
Neumann equation, where one of the generators is an average energy, and 

Q^' the other is a measure of entropy. A nonlinear evolution can occur only for 

' mixed states, and for systems that are described by Renyi a-entropies with 

' a ^ 2. The case a = 2 corresponds to ordinary, linear quantum mechanics. 

' Since a = 2 entropy is the only entropy characterizing systems which cannot 

gain information, the nonlinear dynamics corresponds to "observers", that 

is, systems that can gain information. The new formulation of nonlinear 

quantum mechanics is free from difficulties found in earlier attempts. The 

connection of linearity with possibilities of gaining information is in a striking 

agreement with the ideas of Wigner formulated in his paradox of a friend. 
03.65BZ, Ca 
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I. INTRODUCTION 



In the first part of this paper |l|] I have described the fundamental theoretical difficulties 
of nonlinear quantum mechanics (NLQM) based on a nonlinear Schrodinger equation. In the 
present paper I will present a different generalization of quantum mechanics (QM) where the 
nonlinear evolution never occurs for pure states (the Schrodinger equation is hence always 
linear in this framework) but, instead, may appear, under some circumstances, for mixed 
states. We will see that the new approach will be free from the difficulties discussed in Ref. 

The proposed generalization is based on the idea of rewriting the Liouville-von Neumann 
equation in a triple bracket form, introduced by Bialynicki-Birula and Morrison [§. The 
triple bracket is an infinite dimensional analog of the Nambu bracket where, as opposed 
to the structure constants ekim of the rotation algebra appearing in the original Nambu 
bracket, the structure constants correspond to some infinite-dimensional Lie algebra. In the 
original Nambu paper an evolution of a physical system (a rigid rotator) is generated by 
two "Hamiltonian functions", the energy H and J, where the latter is the Casimir of so{3) 
(squared angular momentum). The metric tensor used for constructing the Casimir is, as 
usual, the one related to the Killing form In the triple bracket formulation of QM the 
analog of J is the Casimir S = l/2Tr(p^) which also can be written as g'^^PaPh although, as 
we shall see later, the metric g""^ is no longer given by the Killing tensor (which does not 
exist in this case). The Casimir S was termed in Ref. the entropy. It will be argued 
below that the assignment of the name "entropy" to S should not be regarded as accidental, 
but as a refiection of a deeper principle relating dynamics with information. 

The fact that a kind of such a relationship should be present in QM follows already 
from the Copenhagen interpretation of a measurement (reduction of a state vector), but our 
approach will be essentially different and closer in spirit to Wigner's paradox of a friend 
0]. Let me recall that Wigner, in order to solve the paradox, concluded that a conscious 
observation must be accompanied by a nonlinear evolution in the space of the observer's 
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states. Even though the argumentation of Wigner looks convincing, it seems that standard 
quantum theories do not leave room for a physical principle of that kind. It is surprising that 
the triple bracket formalism does lead quite naturally to this phenomenon, if we seriously 
treat the intuitions of Bialynicki-Birula and Morrison that S is a, measure of quantum 
entropy. 

Putting things more modestly, we can say that the results of this paper, even if their 
interpretation will turn out inadequate, show that the structure of quantum dynamics may 
be a part of a more general, nonlinear framework. 

II. MEASURES OF INFORMATION AND QUANTUM MECHANICS 

A logarithmic measure of information was introduced by R. V. Hartley in 1928 0. 
According to him, to characterize an element of a set of size N we need log2 N units of 
information. It follows that a unit of information (1 bit) is the amount of information 
necessary for a characterization of a pair. Of course, one can choose also other units such 
that the unit is the amount of information necessary for a characterization of a set with 
< /c G N elements, or even with < r G R elements in average. The respective measures 
of information in arbitrary units a are log„ A^. The most important feature of the logarithmic 
information measure is its additivity: If a set Eisa, disjoint union of M A^-tuples . . . , Em-, 
then we can specify an element of this MA^-element set E in two steps: First we need log^ M 
units of information to describe which Ek of the sets . . . , Em contains the element, then 
we need log^ N further units to tell which element of this Ek is the considered one. The 
information necessary for a characterization of an element of E is the sum of the partial 
informations: log^j MN = log^ M + log^j A^. Next step in the developement of the measures 
of information was done independently by C. E. Shannon ||^ and N. Wiener [^] in 1948 who 
derived a formula analogous to Boltzman's entropy. Their formula has the following heuristic 
motivation. Let E be the disjoint union of the sets Ei, . . . , En having A^^i, . . . , Nn elements 
respectively (J2k=i^k = N). Let us suppose that we are interested only in knowing the 
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subset Ek- (This is typical for classical statistical problems in physics: Statistical quantities 
depend on classes of microscopic conditions and not on single microscopic properties.) The 
information characterizing an element of E consists of two parts: The first specifies the 
subset Ek containing this particular element and the second locates it within Ek- The 
amount of the second piece of information is, by Hartley formula, log^ A^^ thus depends 
on the index k. On the other hand, to specify an element of E we need log^ units of 
information. The amount necessary for the specification of the set Ek is therefore 

1 

4 = log„ N - log, Nk = log, — = log, -. (1) 

^Vfc Pk 

It follows that the amount of information received by learning that a single event of proba- 
bility p took place equals 

/(p) = log.^- (2) 

In statistical situations measured quantities correspond to averages of random variables. 
Therefore the average information is 

/ = 5]pfclog,-3-. (3) 

k 

This is the Shannon's formula and / is called the entropy of the probability distribution 
{pi, . . . ,p„}. If all the probabilities are equal then the Shannon's formula is equal to 
the Hartley's one. The mean we have applied is the so-called linear mean. Renyi observed 
that there exist information theoretic problems where the measures of information are those 
obtained by more general ways of averaging — the Kolmogorov-Nagumo function approach 
ip. Let be a monotonic function on real numbers. The Kolmogorov-Nagumo average 
information can be defined by means of as 



(4) 



If the generalized information measure is to satisfy the postulate of additivity, ip must be a 
linear or exponential function. The linear function corresponds to Shannon's information. 



The exponential functions provide a large class of new measures of information. Consider a 
function ^p{x) = a'-^""-*^. We can always choose the units of information in such a way that 

For pk = 1/N we obtain again the Hartley formula. Formula @ describes Renyi's a-entropy 
which, from now on, will be denoted /^('P), where V denotes the probability distribution. 
We see that the essential part of the definition is played by 

/:(P) = a^"(^) = (EK)''^""^ (6) 

k 

which is independent of the choice of the unit a. To distinguish between a-entropy and /^('P) 
we shall call the latter a*-entropy (* will remind us that this quantity is multiplicative in 
opposition to the additivity of Ia{V)). (The observation that what is in fact informationally 
fundamental in laiV) is I^i'P) is strenghtened by Daroczy's definition of entropy of order a 
p!0| defined as 

{2'---l)-^(j2pt-l). (7) 

k 

This expression possesses many ordinary properties of the entropy and in the limit a ^ 1 
becomes, the so-called Shannon's information function.) 

The limit a — ^> 1 is interesting also for a-entropies. It can be shown that /i = lima^i 
equals Shannon's entropy. 

Ia{V) is a monotonic, decreasing function of a. For negative a Ia{V) tends to infinity 
if one of pk tends to zero. This property excludes a < because adding a new event of 
probability to a probability distribution, what does not change the probability distribution, 
turns lai'P) into infinity. 

A fundamental notion in information theory is the gain of information. Consider an 
experiment whose results are Ai, . . . , An having probabilities pk = P{A = Ak). We observe 
an event B related to the experiment and obtain a result B = Bi. Now the conditional 
probabilities are pki = P{A = Ak\B = Bi). Consider now a system (an "observer") whose 
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information is measured by some a-entropy. How much information about the random 
variable A has he received by observation oi B = Bil The amount of information he would 
have obtained by observing A = Ak would be equal to 

log. - (8) 
Pk 

if he had not measured B. After having observed B = Bi the amount of information he 
would have obtained by observing A = A^ would be 

log„-^. (9) 

Pki 

It follows that the measurement oi B = Bi has given him already 

loga — - loga — = loga — (10) 

Pk Pkl Pkl 



units of information about A. The expression (|ToD is called the decrease of uncertainty about 



A = A^hy observing B = Bi. We define the gain of information about A, obtained when 
the probability distribution {p^} is replaced by {pki}, by 

If we define the increase of the uncertainty by minus decrease of uncertainty we can calculate 
the average "loss of information" defined by 

-i?''-K^))-T^>-.(i:#T)- (-) 

For Shannon's entropy the gain is minus the loss. For a-entropies the two concepts are 
inequivalent. 

The gain of information defined by ( pJ]) for a > 2 has the same pathological properties as 
la for a < so, it seems, cannot be consistently applied unless we restrict < a; < 2. This 
is the reason why Renyi defined the gain of information as minus the loss, although such a 
definition is less netural. From the viewpoint of our quantum mechanical applications the 
situation is not so clear, however, and the following argument shows that a = 2 is a natural 
value limiting a-s from above. 
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When we speak about information, what we have in mind is not the subjective "infor- 
mation" possessed by a particular, animate observer. In reahty the information contained 
in an observation is a quantity independent of the fact whether it does or does not reach 
the perception of the observer (be it a man, some registering device, a computer, or some 
other physical system). On the other hand, different kinds of entropies introduced above 
may be characteristic for different systems. The entropy (information) is objective in the 
same sense as probability, and in the same sense it is reasonable to expect that there are 
classical and quantum informations, as there are classical and quantum probabilities. 

The procedure leading to the notion of the decrease of uncertainty assumes implicitly 
that after each measurement of a random variable, here B, one can always proceed further in 
getting information about A, and that the procedure terminates when we know everything 
about the state of the system. In classical world this final state of knowledge means no 
uncertainties. Therefore, classically, if there is some lack of knowledge about a system, then 
there exists, in principle, a possibility of gaining information. Putting it more formally, 
we can say that an information characterizing a classical system should allow for different 
gains of information in different situations. The quantum mechanical no-hidden-variables 
postulate means that the probabilistic description of a quantum system does not follow 
from our lack of knowledge about the system. This suggests that a quantum information, 
characterizing a quantum system, might be of such a kind that its corresponding gain of 
information is zero under all circumstances. It is tempting to develop this hypothesis a little 
and find whether a measure of information possessing this property exists. 

The Shannon's information gain is given by 

-J^p^log^— (13) 
V Pk 

and vanishes only if A and B are independent. So this case can be excluded because we 
want the gain of information to be for all probability distributions (this excludes also the 
von Neumann entropy). For a-entropies we find that the vanishing of (|TT]) implies 
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Pkl 



(14) 




l-a 



k 



which can hold for all pk and pki if and only if a = 2. It follows that the only candidate for 
the quantum entropy is the Renyi's 2-entropy which reads 



k 

Expressing the probabilities by means of a density matrix and choosing the unit of informa- 
tion with a = e we obtain 



This kind of entropy is sometimes considered as an alternative to von Neumann's entropy 
| rT| . Our reasoning, based on the assumption that an ordinary quantum system should not 
have a possibility of gaining information, selects this entropy in a unique way. It is clear, 
from the perspective of the Wigner's paradox of a friend, that observers, who can gain 
information, should be described by a 7^ 2-entropies. 

III. POISSONIAN FORMULATION OF QUANTUM MECHANICS 

A departure point for the discussed generalization of linear QM is the observation that 
quantum theory can be regarded as a particular classical infinite dimensional Hamiltonian, 
Poissonian or Nambu-like theory. 

Let 7i be a Hilbert space. Consider the Hamilton equations 



and c.c, where the bars denote complex conjugations and the conventions concerning primed 
and unprimed indices are assumed like in the spinor abstract index calculus |T^. The 
summation convention is as follows: We sum over repeated Roman indices and integrate 
over repeated Greek ones. The integration is with respect to some invariant, or quasi- 
invariant measure on a finite dimensional manifold (mass hyperboloid, spacelike hyperplane 




(15) 




(16) 




(17) 
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in the Minkowski space, etc.). The symbol of the "proper time" derivative describes a 
differentiation with respect to a suitable foliation of space-time (Minkowskian spacelike, or 
Galilean t =const hyperplanes, etc., see Appendix). In Hilbertian formulation of QM the 
"symplectic form" is given by the delta distribution 

uj'^^'ia, a') := a') =: uj^^'6{a, a') (18) 

where 6"^^' = 6ab' = 1 ii A = B' and for A ^ B' in the nonrelativistic QM. For the 
Dirac equation 5"^^' and 5ab' can be represented by the Dirac matrix 70, and the Dirac 
delta function must correspond to the choice of the spacelike hyperplane. In the projective 
space formulation the symplectic form corresponds to the Fubini-Study metric. The inverse 

of u;^^'(q;, a') is 

lAA'{ci,Ci') '■= —iSAA'^{ci,Ci') ='■ ^AA'^ici, Ci') (19) 

where by the inverse we understand that 

uj^^'{a,a')lBA'{f3,a') = SiS{a,f]) (20) 
u;^^'{a,a')lAB'{a,(3') = S^^a' , (3'). (21) 

Accordingly, the form of the Hamilton equations we shall use is 

#A(a) . SH 

— 1 = ^AA'YTTT^ V^^J 



and c.c. (22) describes a quantum evolution of pure states. All observables of the linar 
theory depend on \ip) and via the density matrix p = Let F and G be two such 

observables, that is F[iIj,iIj*] = F[p] and G[ilj,ip*] = G[p]. The Poisson bracket resulting 
from the Hamilton equations is 

^ ' ^ ^^'V5^^(a)5^^,(a) 6Ma)6rA'ic^)r ^ ^ 

Applying the chain rule to the components of the pure state density matrix 

PAA'{a,a') = iJjA{a)^A'i'^') (24) 



we find that 

= Iaa' L '^f ^,, PcB'h,P') , , -{F^ G)). (25) 

So long as the density matrix in is given by ( P^ the bracket is equivalent to the Poisson 
bracket (|2^) . Jordan, in a context of the Weinberg's theory and for a finite dimensional 
Hilbert space, investigated properties of the bracket ( ^5]) with p being an arbitrary density 
matrix. For reasons that will be explained below I will term such a general bracket the 
Bialynicki-Birula-Morrison- Jordan (BBMJ) bracket. 

We will now show that (^), for a general p, can be written in a form of a generalized 
Nambu bracket. Let p be arbitrary. The BBMJ bracket can be rewritten as 

{F, G} = pAA'ia, a')fi%B'cc'(«, 7, V) . ^f. . ^? . (26) 

(>pBB'{p,p') opcc'n,r) 

with 



^^%B'cc'(«,«',/3,/3',7,7') = 5^5i;W5(a,7)5(a',/505(/5,7') 

-5^5^:/cB'5(«,/3)^(a',7')^(7,/?') 
= (27) 

where, in analogy to the spinor calculus, we have clumped together the respective quadruples 
of indices into composite ones (a = {A, A', a, a'), etc.). 

The "structure kernels" satisfy conditions characteristic for Lie-algebraic structure 
constants: 

^"cb = -^\c (28) 

and 

^V^^^e + ^\c^\a + = (29) 

These two conditions imply the Jacobi identity. The composite index form of the BBMJ 
bracket 
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{F,G} = p.fiV^^ (30) 

OPb Opc 

shows that it takes the same form as the generahzed BBM-Nambu bracket written in terms 
of the Wigner function for a scalar field 0. As a matter of fact, the BBMJ bracket is simply 
a different representation of the BBM bracket. The formula (^) looks much the same as 
the Poisson bracket related to the Kiryllow form on coadjoint representations of Lie groups 
(such brackets for general structure constants are called the Lie-Poisson brackets (cf. 

ED). 

It remains to find out how to formulate the explicit triple bracket equivalent to (|30|) . 

In order to do this we first have to define a "metric tensor" to lower the upper index 
in the structure kernels (Bialynicki-Birula and Morrison avoided this difficulty because the 
field they considered had no spinor components). The apparently natural guess (the Killing 
metric) 

g^, = n\,n\c (31) 

is incorrect as (|3lD involves expressions like 5(0) which are not distributions in the Schwartz 
sense. 

The correct definitions are 

gab = -lAB'ia, (3')Iba'{I3, a') (32) 
g'^' = -uj^^\a,P')u^^\p,a'). (33) 

The metric tensor is symmetric 

gab = gba (34) 
and satisfies the invertibility conditions 

g^'gbc = gcbg""- = 5^4'^ («, 7)5(a', 7') =: 5^- (35) 

The metric tensor is a useful tool. Consider for example a p-independent Ff, = Fbb'{P, P')- 
Then 
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F[p] = g-'p^F, = p^^,{[3',a')F%,{a',P') = TrpF (36) 



and we see that linear observables can be naturally expressed with the help of (^). This 
example is important also as an illustration of the convention concerning lowering and raising 
of indices. For notice that 

5F 



although the staggering of indices like F^A'{l3',a') might seem more natural. 
The fully covariant form of the structure kernels is 

^abc = -lAB'ia,l3')IcA'il,a')lBC'il3,l') + lAC'ia,l')lBA'il3,a')IcB'il,l3'). (38) 

One easily verifies that Qabc is totally antisymmetric. 

Following Bialynicki-Birula and Morrison let us introduce the functional 

S2 = lg^'paPb = \TT{p'), (39) 

which is one half of the inverse of Renyi's 2*-entropy. 

The BBMJ bracket is now equal to the following triple bracket 

{F, G} = [F, G, S2] = fi.,e^^^. (40) 

dpa dpb dpc 

The antisymmetry of the triple bracket means that S2 is the Casimir for the BBMJ bracket 
Lie algebra of observables. Another Casimir is Tr p because {Tr p, F} = for any dif- 
ferentiable F (hence not only linear). The wave functions have been eliminated from the 
dynamical equations, but the Hilbert space background is implicitly present in the structure 
kernels and the metric tensor which are defined in terms of uj and /, and in the very notion 
of the density matrix which acts in the Hilbert space. 
Components of the pure state density matrix satisfy 

^Pa = {pa,H}. (41) 

which holds also for general density matrices as can be seen from the familiar, operator 
version of the Liouville-von Neumann equation. It follows that the density matrices form a 
Poisson manifold, as opposed to state vectors that form a phase space. 
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IV. NONLINEAR QUANTUM MECHANICS AS A GENERALIZED NAMBU 

MECHANICS 



The generalizations of quantum mechanics considered by Kibble |16] and Weinberg |T7 



are based on the Hamiltonian framework. The nonlinear evolution is introduced through 
an extension of the class of admissible Hamiltonian functions. More generally, all canonical 
transformations are generated by a larger class of functionals on Hilbert or projective spaces. 
The functionals are a generalization of averages of observable quantities. This fact leads to 
the fundamental difficulty in constructing a probability interpretation of such theories: The 
generalized observables do not form an associative algebra which makes impossible a unique 
definition of powers of observables, the formal counterpart of higher moments of random 
variables measured in experiments. 

The triple bracket form of the Liouville-von Neumann equation shows that the time 
evolution in linear QM has, in fact, two generators: the average energy (Hamiltonian 
function) and the Casimir S, which measures Renyi's a = 2 entropy (or, even more directly, 
Daroczy entropy of order 2). It is natural to ask what will be changed in the theory if, 
instead of generalizing the class of admissible Hamiltonian functions, we shall extend the 
class of entropies. A physical meaning of such an extension would be the one required by 
Wigner in his paradox of a friend: We extend quantum mechanics to systems that can gain 
information. The extended theory has a well defined probability interpretation, because the 
observables are represented by linear operators, provided the scaling by a constant, p Xp, 
is a symmetry of the dynamics. This imposes on the generalized entropies the 2-homogeneity 
condition: S{Xp) = A^S'(p). 

Only for S[p] = l/2Tr(p^), denoted later by 5*2 [p], the linear observables are closed under 
the action of the bracket {■, ■}s '■= [■, ■, S]. If we extend the class of acceptable S, we have to 
accept also a somewhat stronger form of the complementarity principle than in linear QM: 
Observables are always complementary to their time derivatives (see Sec. 0). We shall begin 
the discussion of the generalization with the question whether, for general S, the manifold 
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of states is the Poisson manifold. 



A. The Jacobi Identity 

Let F, G, H and S be arbitrary twice functionally differentiable functionals. We consider 
the expression 

J = {{F, G}s, H}, + {{H, F}s, G}s + {{G, H}s, F}, 
5F 5G 5^S 5H 5S 



6pd 6pe 6pa6pf 6pb 6pc 



i^def^abc + ^bdf^aec + ^ebf^adc) (42) 



which holds good for any S. j^j^ = 9°"^ for 8 = 82 and (^) vanishes in virtue of (^). For 
more general 5* = 8{f2[p\) we find 

2^P' (43) 



Spc df2 
5^8 d^8 , f ^d8 
SpaSpf dfi df2 



4it72PV^ + 2—^"^. (44) 



Inserting these expressions into ( |^ ) we obtain 



6F 6G d"^ 8 6H 98 

J = ^^^irnP" ^TrrP^^def^abc + ^bdf^aec + ^ebf^adc) = (45) 
OPdOPe CI J 2 Opb a/2 

since QabcP°'P'^ = 0. With this choice of 8 we obtain the dynamics given by 

±p^ = {p^,H}s.Mp] (46) 

where C[p] = 2^ = C{f2[p]) is an integral of motion, as we shall see later. The only 
difference with respect to ordinary QM would be in a p-dependent rescaling of time, a 
phenomenon that, in principle, might infiuence lifetime characteristics of physical processes. 

For more general 8 the question of the Jacobi identity is open, hence we have to accept 
the possibility that mixed states in the generalized QM do not form a Poisson manifold. This 
would not be surprising since in various versions of generalizations of the Nambu mechanics 
the Jacobi identity does not hold. 
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B. Composite Systems in the New Framework 

Let the Hilbert space in question and the density matrix of some composite system be 
= Hi ® 7^2 and 

Pa = PAA'{0i, Oc) = PAiA2A'-,A'^{otl, 0-2, a\, a'^. (47) 

The same doubhng of indices concerns 

Iaa'{oi, a') = -iSAiA^^AiA'^K'^i, 0i[)5{a2, a'2). (48) 

Reduced density matrices of the two subsystems are 

= 5^'^^5(a2, a'^)pAiA2A\A'^{<^u "2, a'l, ^2) (49) 

PMA'^i^2, 0/2) = 5^'^i5(q;i, Q;i)pAiA2A;A^(Q;i, q;2, ai, q;2) (50) 

and satisfy 

Sn '^^^'ll^'ri = «i)<^(/?2, /?^)<5(/?;, ai) (51) 

0PBiB2B[B'^[Pl,P2,Pl,P2) ^ 

and 

dpB-,B2B[B!, [Pi, P2, Pi, P2) ^ 

The structure kernels for the composite system are 

^abc ^010261 62C1C2 

= -iiSA^B^Sc^A^SBiCj^SAiB'^SCiA'^SBzC:^ X 

5(ai, /305(7i, «;)5(/3i, y,)S{a2, /3^)5(72, «^)5(/32, 72) 
- SaiC[SbiA[SciB[Sa2c;2Sb2A'^Sc2B'^ x 



5(ai, 7i)<^(/3i, «;)5(7i, P[)5{a2, ^2)^, «^)<^(72, /3^)) ■ 



(53) 



The following two results solve generally the question of faster-than-light telegraphs in both 
Hamiltonian and triple bracket frameworks. 
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Lemma 1 Reduced density matrices of the subsystems satisfy 



aj-f^^-f^ = 0. (54) 



Proof: It is sufficient to contract (Rl) with (pT]) and (|5^).D 



5p^5pg6F^6(^6S 

Spa Sp^a, spdi Sp 



Theorem 2 Let F = F[p^] and G = G[p^^], that is depend on p via (|49|) and (|50|) , then for 
any S 

{F,G}s = 0. (55) 



Proof: By virtue of the lemma one has 

= fiafec^^T^T^P = {F, G}s. (56) 



□ 

Notice that we have not assumed anything but differentiabihty not only about 5* but 
also about F and G. So, in particular, for arbitrary (nonlinear) observables and 5* = 5*2 we 
obtain the Polchinski- Jordan result for Weinberg's nonlinear QM. 



C. Density Matrix Interpretation of Solutions of the Generahzed Evolution Equation 

One of the essential questions we have to clarify concerns the density matrix interpreta- 
tion of the solutions of the generalized Liouville-von Neumann equation 

^p^ = [p^,H,S]. (57) 

There is no general a priori guarantee that the generalized dynamics will conserve positivity 
of p. The next theorems will give a partial answer to this problem. 

In order to attack the question we have to make the language of the S'-brackets more 
readable. Consider the triple bracket [F, G, H] of arbitrary functionals F, G and H. We 
find that 
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r m= 

5p^:,,(/5',a') 6p%{i,(3') 6p^'c,{a',i) 



6F 6G 8H ^^^^ 



5p^:4,(y,a') 5p\,{a',P') 5p^^,(/5',7')' 
Applying the notation of (|37|) (where now the "operator" kernels are in general p-dependent) 
we transform (|5^) into 



F^'^,{a\(3')G^'c,{(3\i)H^'Ai.<^') 

-F^'c,{<^\i)G^\,{(3\a')H^'M,(3') = Tr (59) 

In the last line we have introduced an abbreviated convention based on the assignment to 
any functional F of an operator 

which is defined by the kernel form used in (|59D . For example 



5p 

and 



P = (61) 



6p 

the latter being the shortened form of 



np"-\ (62) 



0pAA'{O!,a') 

X 5(/3;, a)6ia\ /52)5(/3^, Ps) ■ ■ ■ 5(/3;_i, /?„) 

XpB,B',{P2,P2)---PB„BdPn,P'n)- (63) 

The first of these implies the known result 

[F,G,^2] = -^Tr(p[F,G]) (64) 
leading to the von Neumann/Heisenberg equations for states/observables in linear QM 

^Tr(pF) = -zTr(p[F,^]). (65) 
17 



The same equation is valid in the Polchinski- Jordan density matrix formulation of Weinberg's 



NLQM |T8| , p!3| , but then F = F[p], etc. Consider now a functional S (differentiable in f^) 



S[p]=Sih[p],---Jn[p], 



(66) 



where fk[p] = Tr (p''). 



Theorem 3 For any m G N, and any G, if S satisfies (|66D then 



(67) 



Proof: 

[Tr(p-),G,5] =E[Tr(p'"),G,/„]— = -zm J] nTr (G[p-\ p^^-^D^f = 0. (68) 

□ This interesting result covers many nontrivial generalizations of 5*2. As a by-product it 
shows also that the same property holds for the Weinberg-Polchinski- Jordan NLQM because 
we have not assumed that G is linear in p (moreover, it includes other theories where 
observables do not satisfy any homogeneity condition). The particular case m = 1 implies 
that Tr p is conserved by all evolutions, a fact important for a definition of averages. For 
pure states Tr (p™') = (Trp)™ so that the integrals fm are not necessarily independent, but 
for all m, n fm and /„ are in involution with respect to {•, -js. Jordan proved in [|l^ by an 
explicit calculation that in his formulation of Weinberg's nonlinear QM Tr p and Tr p^ are 
conserved — our theorem considerably generalizes this result. 

Theorem 4 Let S satisfy (|6^) and pt he a self-adjoint solution of (p^) . If po is positive and 
has a finite number of nonvanishing eigenvalues Pk{0), < Pk^O) < 1, then the eigenvalues 
of Pt are integrals of motion, and the evolution conserves positivity of pt. 

Proof: Since the nonvanishing eigenvalues of po satisfy < pfe(O) < 1 < 2, it follows that 
for any a Pfc(O)" can be written in a form of a convergent Taylor series. By virtue of the 
spectral theorem the same holds for Pq and Tr (pg)- Each element of the Taylor expansion 
of Tr (pg) is proportional to /„[po], for some n. But /„[po] = fn[pt] hence 
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Tr (p^) = Tr (p^) = EPfc(O)" = E^^W" (69) 

k k 

for all real a. Since all Pk{0) are assumed to be known (the initial condition), we know 
also Z]fcPfc(0)" = J2kPk{t)°' for any a. We can now use the result used in the information 
theory stating that the knowledge of J2kPkit)" for all a uniquely determines Pkit). The 
continuity in t implies that Pk(t) = Pk{0). □ 

The spectral decomposition of the density matrix 

Pt = EPfcl^'^)(^'^l' (70) 

k 

where t ^ \k,t) defines a one-parameter continuous family of orthonormal vectors, leads 
to the unitary (although p-dependent) transformation \k,t) = U{pt, po)\k,0) . The density 
matrix evolves then as follows 

pt = U{pt,po)poU{pt,poy^- (71) 

The question whether the same holds good for po having an infinite number of nonvanishing 
eigenvalues will be left open here. In any case, it seems that the above theorem is sufficient 
at least "for all practical purposes" . 

To make our proposal a little bit more concrete we have to choose some explicit "physical" 
class of S — and here the information theoretic introduction may be helpful. 

The suggestion of Wigner that a natural arena for nonlinear generalizations of the lin- 
ear formalism of QM is the domain of observations leads to investigation of systems that 
can gain information hence are described by a 7^ 2 entropies. A homogeneity preserving 
generalization of S2 for other a-entropies can be, for instance. 



The choice of the denominator is important only from the point of view of the homogeneity 
of the evolution equation. The multiplier 1 — 1/ a guarantees that the evolution of pure states 
is the same, hence linear, for all a ( this is reasonable as pure states have the same, vanishing 



a-entropies) . The generalized Liouville-von Neumann equation following from ([7^) is 
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For pure states and Trp = 1, p" = p and the equation reduces to the ordinary, hnear 
one; for mixed states the evolution is nonhnear unless the states are "so mixed" that p is 
proportional to the unit operator (which makes sense in finite dimensional cases, of course) 
and all a-cntropics reduce to the Hartley formula. 

The evolution of (now linear) observables is governed by 

TV(p»-[F.Hl) (74) 

which shows that for the generalized S the time derivative of an observable is not linear in 
the density matrix. For a — 2 the equations reduce again to the ordinary linear equations. 
It seems that the following choice of Sa is also interesting: 

For pure states the expression reduces to the hnear form — |Tr(p^). The density 

matrix would satisfy then the equation 



^.-^ " (Tr(p"))^/^"-^^-\ ^ 



'dr'^ 2a -1 (Trp)V(«-i) 
which for pure states and normalized p would become 

and the "Boltzmann-Shannon classical limit" a — > 1 of the Renyi entropy is indistinguishable 
from the /i — > classical limit of QM. 

D. Composition Problem for Subsystems with Different Entropies 

Assuming that the formalism is applicable to a description of the composite "ob- 

ject+observer" system, where the nonlinearity is a feature of the observer, we have to know 
how to combine systems that are described by different entropies. 
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I think it is best to approach the question again in an information theoretic way. To begin 
with, let us consider a system whose entropy is /„, and whose subsystems have entropies of 
the same kind: for all k a A;-th system's entropy satisfies la^. = la- Let the k-th subsystem 
be described by a reduced density matrix pk. The entropy of the "large" system should 
be defined, as usual in information theory, as the average entropy of the subsystems. The 
overall entropy of the large systems should not depend on the way we decompose it into 
subsystems. Therefore the average cannot have the apparently natural form 

n 

/ai...a„[p] = Y^PMpk], (78) 
k 

where pk are some weights, because the LHS is sensitive to correlations between the subsys- 
tems whereas the RHS is not, so that the entropy would be sensitive to the decompositions 
which are arbitrary. It seems we have to assume that in such a case the composition takes 
the trivial form 

[p] = Pkla^ [p] = P^Ia [p] = L [p] ■ (79) 

k k 

Consider now a situation where the different subsystems have different entropies, say, /q^,. 
The average entropy of the composite system is now defined in analogy to (|75D as 



W..aM=llPkIaAP\ (80) 



where the probabilities pk are weights describing the "percentage" of each of the entropies 
in the overall entropy of the system. We do not know how to determine the weights — they 
can play a role of parameters characterizing the system. 

The above definitions imply that the a*-entropy of the large system is 



^:...ajp] = n^:jprN (si) 

oik 

so it is natural to define 

Sa,...o.Ap\ = X{SaApr- (82) 
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Denoting the latter expression by S, we obtain 

^Pb = Y.Pk[pb, H, Sa,]^. (83) 

Consider again a system which consists of subsystems equipped with the entropy of the same 
kind. Then 5*0,^. = Sai = S for all k and I and the system evolves according to 

^p, = [p,, H, S] (84) 

as expected. 

Next possibility is that the entropies that sum to the overall entropy are again sums 
of some other entropies. The description of the whole system should not depend on the 
order in which the partial entropies are summed up. So consider two entropies and S^^ , 
with appropriate weights A"^ and A"^^, and let the entropies and S^^ consist of some other 
entropies Sl and Sl^ appearing with weights {p{}^^i and {p{^}f£i, respectively. Then 

j^Pb = E ^'piiPb. H}si§T + E A V{P^., H}s-^ (85) 

k k I I 

which shows that the evolution can be indeed consistently composed of "sub-entropies" . 

If all Sa^[p\ depend on p only via /m[p], like in our definitions ([7^) and ([76|), we know 
that on general grounds they are integrals of motion. Consider a subsystem described by pk 
and which is noninteracting with the subsystem "where the nonlinearity resides" . In such a 
case the overall Hamiltonian function is H[p\ = J2kHk[pk] and 

d S 

-i-pkb = Y.Pi[pkb,Hk[pk],Sc,^]—. (86) 

A system described by pk and S'q,^ can be totally isolated from the "rest of the Universe", 
if for k I, [pkb, Hk[pk], Sai] = and pk = Sa^/S. However, even in such a case the global 
properties of the large system leave their mark on the local properties of all the subsystems 
as Pk S/ Sai, is at most an integral of motion hence depends on initial conditions. Consider 
a general H (including the interaction) and let 

S[p] = Sa,....M = I[SadPkr. (87) 
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where the different subsystems have different entropies. Each of the sub-entropies satisfies 

^S^k W] =T.Pi [Sck [Pk] , H, S^, {pi)]-^ = (88) 

in virtue of the theorem |^ (we have used here the fact that {F, H}s = —{F, S}h), and the 
antisymmetry of the triple bracket. Therefore not only the overall entropy, but also the sub- 
entropies are integrals of motion even if the Hamiltonian function H contains interaction 
terms. 

Consider now the reduced density matrix pk of one of the subsystems. Using the same 
theorem we find that 

^Pkb = Pk[pkb, H, Sa^]-^. (89) 

where S/ Sa^. is an integral of motion but its value depends on initial conditions. If the change 
of the initial conditions does not affect the reduced density matrices in (pTj), the integral 
of motion is also unchanged. Therefore in order to change this quantity we have to change 
correlations between the subsystems. In particular, a time dependence of a linear system 
which is noninteracting with the nonlinear one is insensitive to changes of initial conditions 
within the linear system if the particular form ( ^7|) holds. For global entropies different 
from (|87| ) some kind of sensitivity appears but the influences between the subsystems cannot 
propagate faster than light unless we introduce the projection postulate. 

Such a trace of nonlinearity observed in some linear system might be used to detect the 
nonlinearity. Following Santilli [jl9[ we can expect that an evolution of an internal part of a 
hadron may be nonliner (like in hadronic mechanics). In such a case correlations between 
a hadron (say, a proton) and some linear system (say, an electron) could be observed in a 
form of a p-dependent rescaling of time in the electron's evolution. 



V. COMMENTS 



S. Weinberg wrote in PU| that the "theoretical failure to find a plausible alternative to 
quantum mechanics, even more than the precise experimental verification of linearity, sug- 
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gests (...) that quantum mechanics is the way it is because any small change in quantum 
mechanics would lead to logical absurdities. If this is true, quantum mechanics may be 
a permanent part of physics. Indeed, quantum mechanics may survive not merely as an 
approximation to a deeper truth, (...) but as a precisely valid feature of the final theory." 
This kind of conviction followed from the internal theoretical difficulties of the generaliza- 
tions based on nonlinear Schrodinger equations and general Hamiltonian framework. These 
difficulties have been discussed in detail in [Q. 

The proposal based on the generalized Nambu dynamics is free from those difficulties. 
However, it has some new features with respect to ordinary QM. One of them, the stronger 
complementarity principle, has been already announced. In the generalized framework a 
time derivative of an observable will not, in general, be linear in the density matrix. Since 
we have defined observables as functions necessarily linear in p, the time derivatives of 
observables are not themselves observables. 

I propose the following interpretation of this fact. To fix our attention let us consider 
linear QM and the nonrelativistic position operator. An average velocity of an ensemble of 
particles can be calculated either by first calculating an average position and then taking 
its time derivative, or by first measuring the velocity of each single particle and then taking 
the average. We can say that the first procedure is a calculation of the time derivative of an 
average, whereas the latter is taking the average of the time derivative. The situation can 
be described symbolically by the equation 

An important property of QM is the impossibility of realizing the two procedures simulta- 
neously, as V = p/m and q are complementary. It follows that in a concrete experiment 
we have to decide which way of measuring to choose. In this meaning if we can measure q, 
we cannot measure ^g, and vice versa. To express it differently, if q is observable (not an 
observable!) then ^qis not. 

In triple bracket NLQM the observables will be defined as quantities that are in one- 
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to-one relationship to some experimentally measured random variables (hence the linearity 
in p). Two observables will be said to be complementary if there does not exist a physical 
situation where the two respective random variables can be measured simultaneously, that 
is in a single run of an experiment. There can exist linear operators representing the po- 
sition and the velocity of single members of an ensemble, but if the ensemble evolves in a 
nonlinear way the averages of those observables do not have to satisfy the inherently linear 
condition (|90|) , if the experimental procedures necessary for their measurements cannot be 
simultaneously realized. 

Another fundamental problem, arising in the Nambu-like description, is the action prin- 
ciple leading to the triple bracket equation. The Hamiltonian NLQM proposed by Kibble 
or Weinberg can be derived from the ordinary Lagrangian formalism. The triple bracket 
form of dynamics must follow from a new kind of variational principle. The variational 



principle proposed recently by Takhtajan suggests an interesting direction for further 
investigations. 



VI. APPENDIX: HAMILTONIAN FORMULATION OF THE DIRAC EQUATION 

Consider the Dirac equation 

{il^^a - m)i) = (91) 

where Va = da + ie$a and is an electromagnetic potential world-vector. We are going 
to rewrite the equation in a form of the "proper-time" covariant Hamilton equations of 
motion. The "proper time" will be defined in terms of spacelike hyperplanes constructed 
as follows. Let crT-{x{T)) = be an equation defining a family of spacelike hyperplanes. 
The field of timelike, future-pointing, normalized vectors n^(x) oc d°'o'r{x), satisfying the 
continuity equation dan'^{x) = 0, defines the field of "proper time" directions. Integral curves 
T 1-^ x"(r) of n^{x), where r is the parameter of the family {(Jr}, play the role of the world- 
lines. We shall need the continuity equation to guarantee the reality of the Hamiltonian 
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function. Notice that this condition ehminates some physically meaningful hyperplanes, like 
the proper-time hyperboloid cr^-lx) = x°'Xa — = 0, but admits simultaneity hyperplanes 
(Tt-(x) = n°'Xa — r = 0. The "proper time" following from the construction should not, for 
this reason, be identified with the ordinary proper time of the electron. The "proper time" 
derivative at x is defined as 

^ = <i^)da. (92) 



Multiplying (|9T|) from left by the Dirac matrices we obtain 



(iV + (x'^^Vb - mj'')ip = 0. (93) 
Writing the four-potential explicitly in 

id^-ijj = {-a'^^db + + iea^^^b + 7717'^)^^ (94) 
and contracting with n^(x) we get 

i^V^(a;) = (~(JabK{x)^ + e<(x)$a(a;) + iea„f,<(x)$''(x) 



+ mn"(x)7aj?/'(x) (95) 

= Hi){x). (96) 

In the spinor language 

^V^^'ct>A = ^^X^' =ig,''^'V'^ct>A (97) 

i^AA'X^' = -l^(t>A = WaAA'^'X^' (98) 

where /i = m/ \/2 and are the Infeld-van der Waerden symbols . Using the identities 



fxA'9'^^^9\A<9''''^=9''W (99) 

9^XA<9'^^ -9\a<9'''^ =^o^'I (100) 

„a J>AY' J) „aAY' A^ab Y' /'I ni ^ 

9 AX'9 - 9 AX'9 = 4a ^, (101) 

where cr°*x^ ^^'^ ^"'^ x^' generators of (^,0) and (0, |) representations of 5*^(2,0) we 
obtain 
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«Va0x = -4:ia^f^x^V^(j)Y + 2figaxx'X 



(102) 



zVax""' = M^aw^'^'x'^' - 2/i^7/^'0x (103) 

and the equations obtained by their complex conjugation. These equations are especially 
simple if we express generators and Infeld-van der Waerden symbols in purely spinorial terms. 
Remembering that = 1 implies uaa't n^"^' = ^^^^ we get after some calculations 

i-^<Px{x) = in^^'{x)VxY'<pY{x) + l^n-rxx'{x)x^' (x) 
dr 

+ e<(x)<l>a(x)0x(a;) (104) 

+ e<(x)<l>,(x)x^'(s) (105) 

Let dar{x) be some invariant measure on the hyperplane ar- The equations can be derived 
from the Hamiltonian function 

H[i;,r] = {m\^) 

{^0;,,(x)nf^'(x)nr'(x)Vxy'0y(a:) 
-iX*''ix)n,xx'ix)nryY'ix)V^'''x^'ix) (106) 

+ ^fi {<P*x,{x)x''' (x) + x*''{x)Mx)) 

+ e<(a;)$a(x)nf^'(x)(0x(a;)(/)x'(a;) + X*xix)Xx'ix))}darix) 



provided 



urxx'ix) = rirxvix) = (107) 



and the wave functions vanish at boundaries of the hyperplane a,-. Reality of H is guar- 
anteed by the same conditions. The Hamiltonian function is not positive definite, which is 



correct since we are working here in first quantized formalism. Contraction of (|107| ) over 
the remaining indices implies the continuity equation discussed above. 
The explicit form of the Hamilton equations is 
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,:„;-'wAfcW = -^, (108) 

in.xAx)-X (^) = J^y (109) 

and c.c, or, in the Poissonian way, 

d , , , . 6H . 

,-M,) = 2n^;<,,.(x)j^y (110) 

We can see that irirxx'ix) = ujrxx'ix) are the components of the symplectic (since derivable 
from a Kahler potential || ip |p) form on 0"^ at point x & ar, and the Poissonian form 
Irxx'ix) = -2inrxx'{x). 

Let 7"^, a = 0, 1, 2, 3, be the Dirac matrices. The Hamilton equations equivalent to the 
Dirac equation written in the bispinor form are 

><(.)7r^*W^^. (112) 

and c.c, where * denotes the complex conjugation. The formulas are simplest if we take 
simultaneity hyperplanes foliation of the Minkowski space. Then ?2"(a;)7"^ = 70^. Denoting 
its inverse by 700-/3 we find that 

^^^x) = -70.,^ (113) 
d 6H 

So here the Poissonian form is lap = —i'joafs and the Dirac matrix •yoa/s corresponds to 6ab' 
discussed in |Tl|. The transition to the triple bracket formalism is now straightforward. 
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